Enzyme function less conserved than anticipated.

نویسنده

  • Burkhard Rost
چکیده

The level of sequence similarity that implies similarity in protein structure is well established. Recently, many groups proposed thresholds for similarity in sequence implying similarity in enzymatic function. All previous results suggest the strong conservation of enzymatic function above levels of 50% pairwise sequence identity. Here, I argue that all groups substantially overestimated the conservation of enzyme function because their data sets were either too biased, or too small. An unbiased analysis suggested that less than 30% of the pair fragments above 50% sequence identity have entirely identical EC numbers. Another surprising finding was that even BLAST E-values below 10(-50) did not suffice to automatically transfer enzyme function without errors. As expected, most misclassifications originated from similarities in relatively short regions and/or from transferring annotations for different domains. Both problems cannot be corrected easily by adjusting the thresholds for automatic transfer of genome annotations. A score relating sequence identity to alignment length (distance from HSSP-threshold) outperformed statistical BLAST scores for high sequence similarity. In particular, the distance score allowed error-free transfer of enzyme function for the 10% most similar enzyme pairs. The results illustrated how difficult it is to assess the conservation of protein function and to guarantee error-free genome annotations, in general: sets with millions of pair comparisons might not suffice to arrive at statistically significant conclusions. In practice, the revised detailed estimates for the sequence conservation of enzyme function may provide important benchmarks for everyday sequence analysis and for more cautious automatic genome annotations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of Biochemical Composition and Enzyme Activities in Browned Arils of Pomegranate Fruits

Aril browning threatens production, consumption, and exports of pomegranates, because affected fruit cannot be externally distinguished from healthy fruit. This study compared the mineral, biochemical composition, and related enzyme activities in affected brown arils with healthy ones in ‘Malase Saveh’ pomegranates. The results indicated that concentrations of Cu in the aril and K, Mg, and Mn i...

متن کامل

The Role of Highly Conserved Tryptophan in the Sixth Conserved Region at Substrate Specificity of α- amylase

Early in this study, an α-Amylase from Bacillus megaterium WHO (BMW) was isolated from hot springs of Ramsar (North of Iran), and its gene was cloned in E.coli. Based on its conserved sequence regions and substrate specificity, it was classified as intermediary group enzymes with the specificity of oligo-1,6-glucosidase and neopullulanase subfamilies. In the sixth conserved re...

متن کامل

Conserved and non-conserved residues and their role in the structure and function of p-hydroxybenzoate hydroxylase.

In order to elucidate the molecular mechanism of the catalytic reaction and enzyme conformation, we substituted 53 conserved residues identified by aligning 92 p-hydroxybenzoate hydroxylase sequences and 19 non-conserved residues selected from crystallographic studies of Pseudomonas fluorescens NBRC14160 p-hydroxybenzoate hydroxylase with 19 other naturally occurring amino acids, yielding a dat...

متن کامل

CLONING AND EXPRESSION OF LEISHMANOLYSIN GENE FROM LEISHMANIA MAJOR IN PRIMATE CELL LINES

Leishmanolysin is a worldwide disease that is caused by different species of the genus Leishmania. Leishmanolysin, One of the genes expressed by Leishmania, appears to be an ideal candidate for genetic vaccination. In this study, a full length sequence, which encodes Leishmanolysin functionally critical regions (amino acids 100-579), was cloned from a Leishmania strain endemic to Iran. Analysis...

متن کامل

Zinc-dependent cell growth conferred by mutant tRNA synthetase.

We present evidence that zinc bound near the C terminus of a long tRNA synthetase polypeptide, and at a location far in the sequence from the catalytic domain, is needed to sustain cell growth and is, therefore, essential for enzyme function. Several class I and class II tRNA synthetases contain bound zinc, including the 939-amino acid class I Escherichia coli isoleucyl-tRNA synthetase, which h...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of molecular biology

دوره 318 2  شماره 

صفحات  -

تاریخ انتشار 2002